Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding
نویسندگان
چکیده
Neural networks can be successfully used to improve several modules of advanced video coding schemes. In particular, compression colour components was shown greatly benefit from usage machine learning models, thanks the design appropriate attention-based architectures that allow prediction exploit specific samples in reference region. However, such tend complex and computationally intense, may difficult deploy a practical pipeline. This work focuses on reducing complexity methodologies, set simplified cost-effective for chroma intra-prediction. A novel size-agnostic multi-model approach is proposed reduce inference process. The resulting architecture still capable outperforming state-of-the-art methods. Moreover, collection simplifications presented this paper, further overhead architecture. Thanks these simplifications, reduction number parameters around 90% achieved with respect original methodologies. Simplifications include framework convolutional operations, cross-component processing model integrated into architecture, methodology perform integer-precision approximations aim obtain fast hardware-aware implementations. schemes are Versatile Video Coding (VVC) pipeline, retaining efficiency intra-prediction methods based neural networks, while offering different directions significantly complexity.
منابع مشابه
Intra-prediction for Video Coding with Neural Networks
Intra-prediction is amethod for coding standalone frames in video coding. Until now, this hasmainly been done using linear formulae. Using an Artificial Neural Network (ANN)may improve the prediction accuracy, leading to improved coding efficiency. In this degree project, Fully Connected Networks (FCN) and Convolutional Neural Networks (CNN) were used for intra-prediction. Experiments were done...
متن کاملMultiple Line-based Intra Prediction for High Efficiency Video Coding
Traditional intra prediction usually utilizes the nearest reference line to generate the predicted block when considering strong spatial correlation. However, this kind of single line-based method does not always work well due to at least two issues. One is the incoherence caused by the signal noise or the texture of other object, where this texture deviates from the inherent texture of the cur...
متن کاملRegression-based Intra-prediction for Image and Video Coding
By utilizing previously known areas in an image, intra-prediction techniques can find a good estimate of the current block. This allows the encoder to store only the error between the original block and the generated estimate, thus leading to an improvement in coding efficiency. Standards such as AVC and HEVC describe expert-designed prediction modes operating in certain angular orientations al...
متن کاملDerivation for Adaptive Scan of Intra Prediction in Video Coding
H.264 [1] and AVS [2] are the emerging state-of-art video coding standards, in which the block-based hybrid coding framework is employed. DCT or its approximation such as Integer Cosine Transform (ICT) is the centerpiece for signal decorrelation and energy compaction. After DCT, the transform coefficients are quantized and scanned into one-dimensional (1-D) signals, named run-level pairs. The k...
متن کاملSpatial prediction based intra-coding
According to H.264 video coding standard, spatial prediction is used for intra block coding. The luma prediction may be based on 4×4 block, for which there are nine prediction modes, or 16×16 macroblock., for which four prediction modes. For chroma prediction, there are also four prediction modes. In this paper, a new method is proposed for improving the intra prediction algorithm, the first st...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Journal of Selected Topics in Signal Processing
سال: 2021
ISSN: ['1941-0484', '1932-4553']
DOI: https://doi.org/10.1109/jstsp.2020.3044482